Querying Probabilistic Neighborhoods in Spatial Data Sets Efficiently
نویسندگان
چکیده
The probability that two spatial objects establish some kind of mutual connection often depends on their proximity. To formalize this concept, we define the notion of a probabilistic neighborhood : Let P be a set of n points in R, q ∈ R a query point, dist a distance metric, and f : R → [0, 1] a monotonically decreasing function. Then, the probabilistic neighborhood N(q, f) of q with respect to f is a random subset of P and each point p ∈ P belongs to N(q, f) with probability f(dist(p, q)). Possible applications include query sampling and the simulation of probabilistic spreading phenomena, as well as other scenarios where the probability of a connection between two entities decreases with their distance. We present a fast, sublinear-time query algorithm to sample probabilistic neighborhoods from planar point sets. For certain distributions of planar P , we prove that our algorithm answers a query in O((|N(q, f)|+ √ n) logn) time with high probability. In experiments this yields a speedup over pairwise distance probing of at least one order of magnitude, even for rather small data sets with n = 10 and also for other point distributions not covered by the theoretical results.
منابع مشابه
Developing a BIM-based Spatial Ontology for Semantic Querying of 3D Property Information
With the growing dominance of complex and multi-level urban structures, current cadastral systems, which are often developed based on 2D representations, are not capable of providing unambiguous spatial information about urban properties. Therefore, the concept of 3D cadastre is proposed to support 3D digital representation of land and properties and facilitate the communication of legal owners...
متن کاملNatural Neighbor Concepts in Scattered Data Interpolation and Discrete Function Approximation
The concept of natural neighbors employs the notion of distance to define local neighborhoods in discrete data. Especially when querying and accessing large scale data, it is important to limit the amount of data that has to be processed for an answer. Because of its implicit definition on distances, the natural neighbor concept is extremely well suited to provide meaningful neighborhoods in sp...
متن کاملProbabilistic Skyline Queries over Uncertain Moving Objects
Data uncertainty inherently exists in a large number of applications due to factors such as limitations of measuring equipments, update delay, and network bandwidth. Recently, modeling and querying uncertain data have attracted considerable attention from the database community. However, how to perform advanced analysis on uncertain data remains an interesting question. In this paper, we focus ...
متن کاملSpatial analysis of distribution and access to urban services at the level of urban neighborhoods with a spatial justice approach (Case study: Commercial uses of Ardabil city)
One of the most important and urgent issues of urban planning is the equitable distribution of facilities, services and accessibility of citizens at the urban level. Economic and commercial centers, including banks and financial institutions, are one of the most important economic sectors of cities and can be sustained. Social, economic, physical, and environmental impacts of neighborhoods. The...
متن کاملProbabilistic Linkage of Persian Record with Missing Data
Extended Abstract. When the comprehensive information about a topic is scattered among two or more data sets, using only one of those data sets would lead to information loss available in other data sets. Hence, it is necessary to integrate scattered information to a comprehensive unique data set. On the other hand, sometimes we are interested in recognition of duplications in a data set. The i...
متن کامل